test: aac add test coverage for `ParseError` variants #6631

federico-stacks · 2025-10-28T10:36:15Z

Description (UPDATED)

This PR adds consensus test coverage for the ParseError and related ParseErrors enum variants (alias ParseErrorKind) in preparation for the upcoming refactor activities.

What's done

added test coverage for ParseErrors variants for the scenarios that are functionally feaseable.
added test documentation to point to the related variant being tested
added a report that for each variant sum-up "in-code" if it has been put under test or not with related reason (see variant_coverage_report function trick).
added a bunch of unit tests related to build_ast function as support for writing the consensus ones. Also added a cost tracker facility that allow to use the LimitedCostTracker and real cost functions.

Prevented test process from exploding (due to console output) when converting to string the IllegalASCIIString(String) variant (when the string field is bigger then 1MB), ellipsing the string (only for test). This is caused by this info log statement that print the clarity error as debug:

stacks-core/stackslib/src/chainstate/stacks/db/transactions.rs

Lines 1309 to 1312 in 12c8d14

    
           info!( 
        
               "Runtime error in contract analysis for {contract_id}: {other_error:?}"; 
        
               "txid" => %tx.txid(), 
        
           );

Fixed CircularReference(Vec<String>) making the string list deterministic in terms of ordering (otherwise test result would flip/flop due to circular reference random ordering)
improved the Test Harness macros contract_deploy_consensus_test! and contract_call_consensus_test! so that now it is possible to encapsulate them in a "native" [test] function. By this we could now add proper rust documentation for test cases and having the IDE browsing support for the tests (e.g. outliner view, etc...)

Heads-up

> Stack Depth
A note about how I succeeded to test the VaryExpressionStackDepthTooDeep. It was only possible because of a "gap" on how the stack depth limit is managed while parsing:

in Parser::parse_node(..) with MAX_NESTING_DEPTH = AST_CALL_STACK_DEPTH_BUFFER + MAX_CALL_STACK_DEPTH + 1 ( total = 70) it is managed the nesting depth (with different rules for list and tuple) and if it pass,
in VaryStackDepthChecker::check_vary(..) a second check is done with depth to be < AST_CALL_STACK_DEPTH_BUFFER + MAX_CALL_STACK_DEPTH (total 69), applying same rule for list and tuple.

Basically with this difference of 1 (70 vs 69), I succeeded to make the test case pass the first step and fail in the second step.

Just to make sure I understand: we already know that 64 is the stack limit and that we allow a bit of extra “space,” but I’d like to better understand why we’re using different stack limits for the two checks?

> Odd ParseErrors Conversion
I notice some cases where a ParseErrors (alias ParseErrorKind) is obtained by converting from other error families.
In this example, Value::buff_from(bytes) produce an InterpreterError (alias VmExecutionError), internally due to a possible CheckErrors (alias CheckErrorKind), which is then remapped to a ParseErrors (alias ParseErrorKind) with the match expression

stacks-core/clarity/src/vm/ast/parser/v2/mod.rs

Lines 949 to 966 in 12c8d14

    
           Token::Bytes(data) => { 
        
               let mut expr = match hex_bytes(data) { 
        
                   Ok(bytes) => match Value::buff_from(bytes) { 
        
                       Ok(value) => PreSymbolicExpression::atom_value(value), 
        
                       _ => { 
        
                           self.add_diagnostic( 
        
                               ParseErrors::InvalidBuffer, 
        
                               token.span.clone(), 
        
                           )?; 
        
                           PreSymbolicExpression::placeholder(token.token.reproduce()) 
        
                       } 
        
                   }, 
        
                   Err(_) => { 
        
                       self.add_diagnostic( 
        
                           ParseErrors::InvalidBuffer, 
        
                           token.span.clone(), 
        
                       )?; 
        
                       PreSymbolicExpression::placeholder(token.token.reproduce())

Maybe clarity types (in clarity-types/src/types/mod.rs) should have their own error layer that could be then converted properly by the user.

> ParseErrors::NameAlreadyUsed rename?
This error variant seems only used in case of Trait name. So it maybe renamed accordly (like TraitNameAlreadyUsed)

stacks-core/clarity/src/vm/ast/traits_resolver/mod.rs

Lines 57 to 68 in 12c8d14

    
           DefineFunctions::Trait => { 
        
               if args.len() != 2 { 
        
                   return Err(ParseErrors::DefineTraitBadSignature.into()); 
        
               } 
        
               match (&args[0].pre_expr, &args[1].pre_expr) { 
        
                   (Atom(trait_name), List(trait_definition)) => { 
        
                       // Check for collisions 
        
                       if contract_ast.referenced_traits.contains_key(trait_name) { 
        
                           return Err( 
        
                               ParseErrors::NameAlreadyUsed(trait_name.to_string()).into() 
        
                           );

> Duplicated MAX_STRING_LEN const
This constant seems to be duplicated in clarity and clarity-types with different types, but same value:

clarity::vm::ast::parser::v2::MAX_STRING_LEN: usize = 128;
clarity-types::representations: MAX_STRING_LEN u8 = 128
- then also exposed by clarity crate as clarity::vm::representations::MAX_STRING_LEN

Could we merge the two? Maybe the usize one to u8 considering that the second is exposed by the clarity-types?

Possible Follow-ups

Adding more unit-tests for build_ast function. I noticed that we miss unit test coverage for this function (a lot of parse error variants don't have related test cases), so could be a valuable tasks.
About the "string ellipsing" on IllegalASCIIString(String) , a better approach would be to do it on the ParseErrors enum side, but that would require to implement a custom Debug trait implementation for the enum. To avoid to implement the debug format for each variant, it would be possible to use some crate like derive-debug that allows to override the debug format selectively at variant level. If we are interested in this, then it could be a follow-up.
Refactoring activity around the ParseErrors variants to clarify better the code intent. (unreachable variants, separating parse error from cost error, remove if possible the From error traslation, etc...). NOTE: For sure before starting would be better to have "aac plowing" phase completed (with the enum renaming), to avoid a conflict hell

Draft Description (OLD)

This PR adds consensus test coverage for the ParseError enum in preparation for the upcoming refactor of that enum. The PR is currently a draft to gather early feedback on test organization and macro usage.

The PR is currently a draft to gather early feedback about test organization and macro usage::

Although this is not to be intended as the final structure, I’ve started isolating ParseError related tests in a dedicate module parse_tests.rs
To support this, I've exported the contract_deploy_consensus_test! and contract_call_consensus_test! macros opting for:
- scoped export, to prevent visibilitty at the crate root level
- impoort consistency, so macros can be used without additional imports (i.e., resolving internal macro dependencies using absolute namespaces).
Futhermore I've added clarity_types crate as test dependency for documentation purposes (doc referencing Parse error variants). This is just a preliminary attempt, and I open to remove it if it's not beneficial (I'm also trying understanding this)
- Currently, it doesn’t work properly with the macro, causing IDE warnings and broken hyperlinks (would function correctly with a standard test function)
- I also noticed that with the macro we loose the Outline view (is empty)
  
  So, apart the doc thing, I'm wondering if in general would be convenient having "native" test functions while keeping the macro just for configuring the test body. Something like this:
```
#[test]
fn test_parse_error_lexer() {
    contract_deploy_consensus_test!(
       parse_error__import_trait_bad_signature,
       contract_name: "my-contract",
       contract_code: &{"(use-trait)"},
   );
}
```

Applicable issues

fixes AAC Testing: Add test coverage for ParseError enum #6627

Additional info (benefits, drawbacks, caveats)

Checklist

Test coverage for new or modified code paths
Changelog is updated
Required documentation changes (e.g., docs/rpc/openapi.yaml and rpc-endpoints.md for v2 endpoints, event-dispatcher.md for new events)
New clarity functions have corresponding PR in clarity-benchmarking repo

…ks-network#6627

…tacks-network#6627

stackslib/src/chainstate/tests/consensus.rs

jacinta-stacks · 2025-10-28T15:08:49Z

stackslib/src/chainstate/tests/parse_tests.rs

+
+/// ParserError: [`ParseErrors::Lexer`]
+/// Caused by: unknown symbol
+/// Outcome: block accepted.


I like your comments. Clear, concise, easy to follow/reason about.

jacinta-stacks · 2025-10-28T15:13:21Z

I actually think it might be slightly easier to reason about the test name/be able to find tests if we did something like

#[test]
fn test_parse_error_lexer() {
    contract_deploy_consensus_test!(
       function_name!(),
       contract_name: "my-contract",
       contract_code: &{"(use-trait)"},
   );
}

since when I initially looked at contract_deploy_consensus_test! macro it took me a bit to realize it was a #[test]. But I am not strongly one way or the other. Whichever makes it easier for you to reuse these functions in your test, go for it. If you need help fixing your errors, let me know. I will give it a look.

federico-stacks · 2025-10-28T16:21:25Z

I actually think it might be slightly easier to reason about the test name/be able to find tests if we did something like
#[test]
fn test_parse_error_lexer() {
    contract_deploy_consensus_test!(
       function_name!(),
       contract_name: "my-contract",
       contract_code: &{"(use-trait)"},
   );
}
since when I initially looked at contract_deploy_consensus_test! macro it took me a bit to realize it was a #[test]. But I am not strongly one way or the other. Whichever makes it easier for you to reuse these functions in your test, go for it. If you need help fixing your errors, let me know. I will give it a look.

Considering we have a macro, we should be able to use function_name!() internally.
Otherwise, if we also expose the function_name() as in your example, we could even replace the macro with a function.

…k#6627

federico-stacks · 2025-10-30T10:55:13Z

Based on the feedback, I've updated consensus macros as proposed in the PR description.
It seems to work nicely (with documentation, but also I added a should_panic test for unreachable error variant)

By the way, adding more tests exposed a new issue related to the vm_error field, basically this test:

#[test]
fn test_circular_reference() {
    contract_deploy_consensus_test!(
        contract_name: "my-contract",
        contract_code: &{"
            (define-constant my-a my-b)
            (define-constant my-b my-a)
        "},
    );
}

produce a non-deterministic vm_error description. Here the possible output:

detected interdependent functions (my-a, my-b)
detected interdependent functions (my-b, my-a)

where the element listed in the paranthesis can have a different order and the flip/flop between them at each run.

codecov · 2025-10-30T11:19:31Z

Codecov Report

❌ Patch coverage is 87.39496% with 15 lines in your changes missing coverage. Please review.
✅ Project coverage is 74.38%. Comparing base (f8d02cd) to head (298959a).

Files with missing lines	Patch %	Lines
stackslib/src/chainstate/tests/parse_tests.rs	85.98%	15 Missing ⚠️

❌ Your project check has failed because the head coverage (74.38%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #6631      +/-   ##
===========================================
+ Coverage    64.85%   74.38%   +9.52%     
===========================================
  Files          574      575       +1     
  Lines       355039   355145     +106     
===========================================
+ Hits        230278   264188   +33910     
+ Misses      124761    90957   -33804

Files with missing lines	Coverage Δ
stackslib/src/chainstate/tests/consensus.rs	`95.00% <100.00%> (+9.78%)`	⬆️
stackslib/src/chainstate/tests/mod.rs	`70.08% <ø> (ø)`
stackslib/src/chainstate/tests/parse_tests.rs	`85.98% <85.98%> (ø)`

... and 373 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f8d02cd...298959a. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

francesco-stacks

Good changes! Definitely fine with the named tests.

PS:I am sure you already noticed, but some unit tests are currently failing

stackslib/src/chainstate/tests/parse_tests.rs

...tests/snapshots/blockstack_lib__chainstate__tests__parse_tests__trait_reference_unknown.snap

stackslib/src/chainstate/tests/parse_tests.rs

federico-stacks · 2025-10-30T16:54:11Z

Good changes! Definitely fine with the named tests.

PS:I am sure you already noticed, but some unit tests are currently failing

Yeah! This is due the vm_error for the test_circular_reference being not deterministic as mentioned in this comment: #6631 (comment)

…tacks-network#6627

…twork#6627

federico-stacks · 2025-10-31T12:17:19Z

By the way, adding more tests exposed a new issue related to the vm_error field, basically this test:
#[test]
fn test_circular_reference() {
    contract_deploy_consensus_test!(
        contract_name: "my-contract",
        contract_code: &{"
            (define-constant my-a my-b)
            (define-constant my-b my-a)
        "},
    );
}
produce a non-deterministic vm_error description. Here the possible output:

detected interdependent functions (my-a, my-b)

detected interdependent functions (my-b, my-a)

where the element listed in the paranthesis can have a different order and the flip/flop between them at each run.

I opted to manage this issue, forcing the CircularRefercence being emitted with a determinitic list.
see commit: a81db5d

…oDeep variant test, stacks-network#6627

federico-stacks · 2025-11-10T08:29:39Z

For @jacinta-stacks and @francesco-stacks, who reviewed the PR while it was in draft: I’ve updated the PR description to reflect the final state of the work.

jacinta-stacks

Really clean. I like this a lot.

federico-stacks added 3 commits October 28, 2025 10:53

chore: make aac macros usable from other modules, stacks-network#6627

0db522b

chore: aac add clarity_types as test dependency for doc purpose, stac…

1eead86

…ks-network#6627

test: aac add consensus coverage for ParseError variants subset (11), s…

c01f81e

…tacks-network#6627

federico-stacks self-assigned this Oct 28, 2025

federico-stacks added aac Avoiding Accidental Consensus aac-testing Avoiding Accidental Consensus Testing Specific Task labels Oct 28, 2025

federico-stacks linked an issue Oct 28, 2025 that may be closed by this pull request

AAC Testing: Add test coverage for ParseError enum #6627

Open

federico-stacks requested review from francesco-stacks and jacinta-stacks October 28, 2025 10:36

jacinta-stacks reviewed Oct 28, 2025

View reviewed changes

stackslib/src/chainstate/tests/consensus.rs Outdated Show resolved Hide resolved

jacinta-stacks reviewed Oct 28, 2025

View reviewed changes

federico-stacks added 2 commits October 30, 2025 10:29

refactor: aac consensus macro just produce a test body, stacks-networ…

fe6a8d5

…k#6627

merge: address conflict with develop, stacks-network#6627

298959a

francesco-stacks reviewed Oct 30, 2025

View reviewed changes

federico-stacks added 3 commits October 30, 2025 18:05

crc: remove clarity_types dependency and use clarity crate aliases, s…

2ac068b

…tacks-network#6627

crc: clean contract code string literal, stacks-network#6627

730fc6d

chore: aac fix ParseError::CircularReference indeterminism, stacks-ne…

a81db5d

…twork#6627

federico-stacks force-pushed the chore/aac-parse-error-test branch from 27b1d56 to a81db5d Compare October 31, 2025 12:10

francesco-stacks mentioned this pull request Oct 31, 2025

chore: add consensus tests for MemoryBalanceExceeded #6642

Open

4 tasks

federico-stacks added 5 commits November 3, 2025 08:36

Merge branch 'develop' into chore/aac-parse-error-test

cac6d65

test: add unit tests for build_ast, stacks-network#6627

d28f8bc

test: improve parse consensus test and add VaryExpressionStackDepthTo…

83f08ad

…oDeep variant test, stacks-network#6627

test: add ExpectedWhitespace aac test, stacks-network#6627

1ea97d3

test: add UnexpectedToken aac test, stacks-network#6627

dc157e4

test: add NameTooLong aac test, stacks-network#6627

d9c6948

federico-stacks force-pushed the chore/aac-parse-error-test branch from e62cd5c to d9c6948 Compare November 6, 2025 07:29

federico-stacks added 12 commits November 6, 2025 09:38

chore: document variant_coverate_report, stacks-network#6627

a0f34a8

test: add InvalidPrincipalLiteral aac test, stacks-network#6627

a04618b

test: add InvalidBuffer as unreachable, stacks-network#6627

42f94d6

test: add ExpectedContractIdentifier aac test, stacks-network#6627

9e74b7c

test: add ExpectedTraitIdentifier aac test, stacks-network#6627

190f702

test: add TupleColonExpectedv2 aac test, stacks-network#6627

03368c7

test: add TupleCommaExpectedv2 aac test, stacks-network#6627

6145f1d

test: add TupleValueExpected aac test, stacks-network#6627

49935ab

test: add ContractNameTooLong aac test, stacks-network#6627

4b70a35

test: add IllegalASCIIString aac test, stacks-network#6627

5a1b8c3

test: add IllegalContractName aac as unreachable, stacks-network#6627

2924f6e

chore: fix failing unit tests, stacks-network#6627

8c0a7fc

federico-stacks force-pushed the chore/aac-parse-error-test branch from 1be51dc to 8c0a7fc Compare November 7, 2025 18:03

test: update insta for test_illegal_ascii_string, stacks-network#6627

7848a34

federico-stacks marked this pull request as ready for review November 10, 2025 08:24

federico-stacks requested review from a team as code owners November 10, 2025 08:24

federico-stacks requested review from aaronb-stacks, francesco-stacks and jacinta-stacks November 10, 2025 08:24

Merge branch 'develop' into chore/aac-parse-error-test

1057ea9

jacinta-stacks approved these changes Nov 11, 2025

View reviewed changes

	info!(
	"Runtime error in contract analysis for {contract_id}: {other_error:?}";
	"txid" => %tx.txid(),
	);

	Token::Bytes(data) => {
	let mut expr = match hex_bytes(data) {
	Ok(bytes) => match Value::buff_from(bytes) {
	Ok(value) => PreSymbolicExpression::atom_value(value),
	_ => {
	self.add_diagnostic(
	ParseErrors::InvalidBuffer,
	token.span.clone(),
	)?;
	PreSymbolicExpression::placeholder(token.token.reproduce())
	}
	},
	Err(_) => {
	self.add_diagnostic(
	ParseErrors::InvalidBuffer,
	token.span.clone(),
	)?;
	PreSymbolicExpression::placeholder(token.token.reproduce())

	DefineFunctions::Trait => {
	if args.len() != 2 {
	return Err(ParseErrors::DefineTraitBadSignature.into());
	}

	match (&args[0].pre_expr, &args[1].pre_expr) {
	(Atom(trait_name), List(trait_definition)) => {
	// Check for collisions
	if contract_ast.referenced_traits.contains_key(trait_name) {
	return Err(
	ParseErrors::NameAlreadyUsed(trait_name.to_string()).into()
	);

test: aac add test coverage for ParseError variants #6631

Are you sure you want to change the base?

test: aac add test coverage for ParseError variants #6631

Conversation

federico-stacks commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description (UPDATED)

What's done

Heads-up

Possible Follow-ups

Draft Description (OLD)

Applicable issues

Additional info (benefits, drawbacks, caveats)

Checklist

Uh oh!

Uh oh!

jacinta-stacks Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

jacinta-stacks commented Oct 28, 2025

Uh oh!

federico-stacks commented Oct 28, 2025

Uh oh!

federico-stacks commented Oct 30, 2025

Uh oh!

codecov bot commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

francesco-stacks left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

federico-stacks commented Oct 30, 2025

Uh oh!

federico-stacks commented Oct 31, 2025

Uh oh!

federico-stacks commented Nov 10, 2025

Uh oh!

jacinta-stacks left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

test: aac add test coverage for `ParseError` variants #6631

test: aac add test coverage for `ParseError` variants #6631

federico-stacks commented Oct 28, 2025 •

edited

Loading

codecov bot commented Oct 30, 2025 •

edited

Loading